Phonetic decoding of continuous speech with the APHODEX expert system
نویسندگان
چکیده
In order to increase the accuracy of continuous speech acousticphonetic decoding, we started the APHODEX project some years ago. Our aim is to develop an expert system that implements the knowledge of an expert spectrogram reader, the phonetician F. Lonchamp. In the present version of the system, procedural and declarative approaches have been used jointly for the representation of phonetic expertise relating to : the segmentation of speech into phonemelike units, the classification of segments into phonetic categories (coarse !abeling) and subsequent interpretation in terms of phones (fine labeling). APHODEX is capable of processing hypotheses in parallel, using contextual analysis, fuzzy and uucertain reasoning, especially for the interpretation of cues and featureH extracted from the speech signal. Decoding results are represented in the form of a phonelattice; the description of alternative segmentations and labelings is possible. A prototype version of APHODEX has been implemented, using the SNORRl speech processing environment. Results concerning a multi-speaker corpus of continuous speech are presented and discussed.
منابع مشابه
Phonetic study for automatic recognition of Arabic
We propose in this paper a phonetic study of standard Arabic based essentielly on the spectrographic visions of 50 sentences of the DJOUMA corpus we have constituted. This study allowed us to determine the pertinent parameters of continuous speech recognition. For the recognition part, we present the algorithms developed and the results obtained for three important questions: The segmentation o...
متن کاملLandmark-Guided Segmental Speech Decoding for Continuous Mandarin Speech Recognition
In this paper, we propose a framework that attempts to incorporate landmarks into a segment-based Mandarin speech recognition system. In this method, landmarks provide boundary information and phonetic class information, and the information is used to direct the decoding process. To prove the validity of this method, two kinds of landmarks that can be reliably detected are used to direct the de...
متن کاملVisuo-phonetic decoding using multi-stream and context-dependent models for an ultrasound-based silent speech interface
Recent improvements are presented for phonetic decoding of continuous-speech from ultrasound and optical observations of the tongue and lips in a silent speech interface application. In a new approach to this critical step, the visual streams are modeled by context-dependent multi-stream Hidden Markov Models (CD-MSHMM). Results are compared to a baseline system using context-independent modelin...
متن کاملEfficient decoding strategies for conversational speech recognition using a constrained nonlinear state-space model
In this paper, we present two efficient strategies for likelihood computation and decoding in a continuous speech recognizer using an underlying nonlinear state-space dynamic model for the hidden speech dynamics. The state-space model has been specially constructed so as to be suitable for the conversational or casual style of speech where phonetic reduction abounds. Two specific decoding algor...
متن کاملA transcription-based approach to determine the difficulty of a speech recognition task
A new parameter for estimating the difficulty of a continuous speech recognition task, called speech decoding difficulty, is presented in this work. It is obtained from the language model defined for the recognition task and the phonetic similarity between the transcriptions of the words that make up the vocabulary used. Two variants of the proposed task difficulty measure are introduced: ideal...
متن کامل